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Abstract 

Background: The ainn of this study was to develop a child-specific classification systenn for long bone fractures 
and to exannine its reliability and validity on the basis of a prospective nnulticentre study. 

Methods: Using the sequentially developed classification systenn, three sannples of between 30 and 185 paediatric 
linnb fractures fronn a pool of 2308 fractures docunnented in two nnulticenter studies were analysed in a blinded 
fashion by eight orthopaedic surgeons, on a total of 5 occasions. Intra- and interobserver reliability and accuracy 
were calculated. 

Results: The reliability innproved with successive sinnplification of the classification. The final version resulted in an 
overall interobserver agreennent of k, = 0.71 with no significant difference between experienced and less 
experienced raters. 

Conclusions: In conclusion, the evaluation of the newly proposed classification systenn resulted in a reliable and 
routinely applicable systenn, for which training in its proper use nnay further innprove the reliability. It can be 
reconnnnended as a useful tool for clinical practice and offers the option for developing treatnnent 
reconnnnendations and outconne predictions in the future. 



Background 

Classification systems are widely used in orthopaedic 
and trauma surgery. They play a key role in the report- 
ing of clinical and epidemiological data, allowing for 
uniform comparison and documentation of different 
conditions. They constitute the semantic basis of retro- 
spective and prospective clinical studies by providing a 
common language for defining and categorising pathol- 
ogy. This is becoming increasingly important in the 
implementation of quality control measures for diagnos- 
tic and therapeutic procedures. Therefore, a feasible and 
standardised form of documentation is required that is 
accessible for everyone and easy to use. 
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A useful classification system must be reliable and 
accurate before it can be considered valid [1,2]. Reliabil- 
ity reflects the precision of a classification system and in 
general refers to intraobserver and interobserver reliabil- 
ity. The intraobserver reliability describes the agreement 
between the ratings of one observer performing repeated 
classifications of a given entity, whereas the interobser- 
ver reliability describes the agreement between the rat- 
ings of different observers. Most of the classification 
studies use the Kappa coefficient introduced by Cohen 
[3] to quantify the agreement between raters. It distin- 
guishes true agreement between various observations 
from agreement due to chance alone, and is expressed 
as a value between -1 and 1. A Kappa value of -1.0 
means complete disagreement, 0.0 means chance agree- 
ment and 1.0, complete agreement. Different criteria are 
given in the literature for assessing the strength of 



© 201 1 Schneidmuller et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative 
Commons Attribution License (http://creativecommons.Org/licenses/by/2.0), which permits unrestricted use, distribution, and 
reproduction in any medium, provided the original work is properly cited. 



Schneidmuller et al. BMC Musculoskeletal Disorders 201 1, 12:89 
http://www.biomedcentral.eom/1 471-2474/1 2/89 



Page 2 of 1 3 



agreement. The most widely adopted are those of Landis 
and Koch [4]. 

Classification accuracy is described using latent class 
modelling. The hypothesis is that each fracture belongs 
to one of several real clinically relevant classes, which 
may be theoretically defined, but not directly observable 
in practice. These classes are said to be "latent". The 
analysis aims to identif)^ the most likely number of these 
latent classes in the population, given the selected sam- 
ple of fractures and the agreement data collected among 
the various raters. For each class, the accuracy of classi- 
fication by each rater is estimated [5,6]. 

Numerous fracture classification systems have been 
proposed in orthopaedics [7-23]. Specific paediatric clas- 
sifications are less common. It does not seem appropri- 
ate to adopt a classification system created for adults for 
use in paediatric orthopaedics because certain child-spe- 
cific factors must be considered. The growing bone has 
the capability of spontaneous corrections of remaining 
deviations as well as the risk of growth disturbances. To 
date, only one child specific classification system for 
long bone fractures has been published [24,25]. 

The aim of this study was to develop a specific classi- 
fication system for paediatric long bone fractures 
together with a digital documentation system. The clas- 
sification is based on a preliminary version published in 
2000 [26-28], which has been further developed, 
improved and evaluated with respect to intraobserver 
and interobserver reliability and accuracy. 

Methods 

In the years 2003 and 2005 two prospective multicentre 
studies documenting a total of 2308 fractures were con- 
ducted in 13 paediatric trauma centres in Germany, Swit- 
zerland and Austria. All participants were active 
members of the Li-La paediatric expert group [29-31]. In 
each study hospital, all consecutively treated long bone 
fractures in children up to and including 16 years of age 
were assessed over a period of 3 months. The institu- 
tional review boards of the Universities of Bern, Switzer- 
land, and Giessen, Germany had approved the project. 

Demographic data such as sex and age, history and 
important clinical findings were collected with the 
MEMdoc documentation portal of the Institute for Eva- 
luative Research in Medicine of the University of Bern, 
Switzerland [32]. Primary and follow up x-rays were 
scanned, uploaded via the MEMdoc web interface and 
centrally stored with every patient record. To limit 
selection bias, all cases were included even if the quality 
of diagnostic images was not perfect. 

On the basis of the frequency distribution of fracture 
types in the data-set, 30 x-rays representing the most 
common fracture types were extracted from the pool of 
2308 for use in a pilot study. Typical radiographs were 



selected by 2 orthopaedic surgeons who were not asses- 
sors in the study. These fractures were assessed using 
the new classification system. Eight observers with dif- 
ferent levels of experience participated: three consultant 
surgeons specialised in paediatric trauma and five ortho- 
paedic residents. All raters were blinded to any informa- 
tion about the patient. The patient identification and 
the date on the films were hidden and each case was 
identified with a random number only. In a common 
rating session this series of 30 x-rays was studied and 
evaluated individually by each practitioner. 

On the basis of these results a sample size calculation 
was performed and an expanded group comprising 150 
cases (including the initial 30 pilot cases) was created 
from the pool of 2308 fractures in order to cover the 
complete spectrum of fracture types. These 150 cases 
were classified by the same observers, 6 months after 
the initial series of 30 cases. This allowed the evaluation 
of the inter- and intraobserver reliability in relation to 
the initial 30 cases. 

Following analysis of the results, a simplification of the 
classification system was introduced. This was evaluated 
by the same observers again rating the same 150 cases, 
randomly presented to them, after a further interval of 6 
months. 

For the last agreement study, a completely new frac- 
ture sample was selected that also included more cases 
of some previously underrepresented fracture types for 
which the classification system had been revised again. 
In this way, a new set of 185 fractures was compiled 
(Figure 1). 

In summary the development and validation process 
included a series of four formal agreement studies 
intended to allow for continual improvement of the 
classification system by reviewing the results, identifying 
specific flaws and subsequently adjusting the coding. 

Statistical analysis 

For the first classification session, sample size estimation 
was performed based on the 30 cases from the 2003 
multicentre study. These 30 cases were classified again 
by all 8 raters as part of the first classification session 
with the total 150 fractures. The interobserver reliability 
for those 30 cases was estimated using Kappa coeffi- 
cients to indicate the degree of agreement in ratings 
[33]. The last classification session was conducted with 
185 selected cases to guarantee a sufficient number of 
examples of the most important fracture types. The ana- 
lyses were performed for all raters stratified by experi- 
ence (senior and resident level). For the first letter of 
the classification code (Classification Dimension; GDI) 
all cases were used, for the second one (CD2) only the 
cases with agreement on GDI, for GD3 the cases with 
agreement on GDI and GD2 etc. Calculations were 
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LiLa MC1 & MC2 



2308 cases 



pilot study 



sample of 30 cases of the most frequent fractures of our data 
/. classification session 



sample size calculation 

(6 month difference) 



2nd study 



120 + (initial) 30 cases = 150 cases; intended representation 
of the complete fracture type spectrum 
2. classification session; 30 cases test-retest 



simplification 

(6 month difference) 



3rd study 



same 150 cases 
3. classification session 



simplification 



4th study 



new 185 cases; including underrepresented cases for classification 
4. classification session 



Figure 1 Flow chart of study history. 



done with the MAGREE macro of SAS (SAS Institute 
Inc., Gary, NG, USA), 

A gold standard was predefined by consensus amongst 
two independent senior surgeons. It was used for classifi- 
cation accuracy for each category by each rater 



(percentage of cases correctly classified) and checked by 
"Latent Glass Modelling" using the software latent 
GOLD® (Statistical Innovations Inc. Belmont, MA, USA). 

Over a timeframe of 6 months, two raters classified 
the final 185 fractures twice. For each of the two raters 
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the percent agreement between the first and the second 
ratings and the intraobserver Kappa coefficient were cal- 
culated. This was done for CDl, CDl-2, CDl-3 and 
CD 1-4. The mean agreement and mean kappa values for 
the two raters were calculated. 

Classification system 

The final classification code consists of five (optionally 
six) digits (Figure 2): 

1. According to the AO classification of long bone 
fractures in adults [34] the first digit represents the 
affected part of the upper or lower extremity: 

♦ 1 = humerus 

♦ 2 = forearm 

♦ 3 = femur 

♦ 4 = lower leg 

2. The second digit represents the bone segment 
where the fracture is located: 

♦ 1 = proximal (including epiphysis and metaphysis) 



♦ 2 = middle (diaphysis/shaft) 

♦ 3 = distal (including epiphysis and metaphysis). 
The metaphysis is defined by a square over the 
growth plate of the affected bone (Figure 2). 

3. Because of its therapeutic relevance the third digit 
indicates the assessor's decision as to whether it is an 
articular or non-articular (shaft) fracture. 

♦ All fractures affecting the articular surface, be it 
the epiphysis or the metaphysis (fractures of the ole- 
cranon), are considered to be articular; (a). 

♦ All fractures of the shaft and metaphysis are con- 
sidered to be non-articular: (s). 

4. The fourth digit specificies the morphology of the 
fracture type for articular and shaft fractures separately. 

♦ Articular fractures: 

♦ 1 = epiphyseal with wide open physis (Salter III) 

♦ 2 = epi-metaphyseal with wide open physis 
(Salter IV) 



LiLa Classification 

Version 2 



1. Position 

Localisation in the 
skeleton: 
(1-4) 



2. Position 

Localisation in the 
bone (segment): 
(1-3) 



3. Position 

IVIorphology: 
-articular (joint=a) 
-shaft (s) 



4. Position 

Specification of 
morphology: 
-joint (1-5) 
-shaft (1-5) 



5. Position 

Extent of 
displacement: 

- non-displaced (0) 

- tolerable 
displaced (1) 

- not tolerable (2) 



6. Position 

(exception): 
Non-supportive 
paired bone 
-Ulna (U) 
-Fibula (F) 



- for bones in pairs, the supportive bones will be classified as they are: Radius or Tibia 

- if the non-supportive bone is to be classified it will be done so in the 6th position as a U or F for the Ulna 
and Fibula respectively (such as for exannple joint fx of the Olecranon) 

- the nnetaphysis will be defined by the square over the corresponding growth plate (nneasured with a 
connpass fronn each edge of the growth plate) 











4 









epidenniologically less important joint lesions (0-1% of all fractures) are classified as 5=others (as they are: 
joint fx of the proxinnal Hunnerus, the proxinnal Radius, the proxinnal Ulna, the distal Radius, the distal Ulna 
and the proxinnal Fennur) 

Figure 2 Overall structure of the Li-La Classification of paediatric fractures of long bones. 
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♦ 3 = epiphyseal with beginning physiological 
closure of the plate in adolescents (two-plane/ 
Tilleaux fracture) 

♦ 4 = epi-metaphyseal with beginning physiologi- 
cal closure of the plate in adolescents (tri-plane 
fracture). 

♦ 5 = statistically less important joint lesions are 
subsumed as 5 = others; e.g. intraarticular liga- 
ment avulsions and flake fractures. 

♦ Non-articular/shaft fractures: 

♦ 1 = they start with the most peripherical meta- 
physeal fracture; the epiphyseal separation with 
or without metaphyseal wedge (Salter I and II) 

♦ 2 = metaphyseal greenstick or buckle fractures 
and greenstick or bowing fractures of the shaft 

♦ 3 = all complete fractures including transverse, 
oblique and torsion fractures 

♦ 4 = multifragment fractures 

♦ 5 = statistically less important shaft lesions are 
subsumed as 5 = others; e.g. extra-articular liga- 
ment avulsions. 

5. A fifth optional digit was introduced to divide the 
fracture displacement into 

♦ 0 = non-displaced 

♦ 1 = tolerable displacement 

♦ 2 = intolerable displacement 

to indicate the likelihood of spontaneous correction of 
displacement by further growth. Tolerable displacement 
indicates displacement that is reliably known to either 
correct itself spontaneously during further growth or, in 
case it persists, to have no clinically relevant functional 
or cosmetic consequences. To date this is still an indivi- 
dual, subjective decision. Provisionally, a fracture gap 
greater than 2 mm is considered to represent displace- 
ment in all epiphyseal fractures [35]. 

6. The sixth digit helps to specify the fractures of 
paired bones (forearm and lower leg). In general, the 
supportive bone is classified as it is: Radius for the fore- 
arm and Tibia for the lower leg. If the other bone is 
affected and needs special description, for example with 
a fracture of the proximal Ulna, isolated fracture of the 
ulna or fibula, U will be used for ulna and F for fibula. 

There is only one exception to this classification pat- 
tern. Because of their frequency and peculiarities in frac- 
ture healing and possible complications, fractures of the 
distal humerus received a separate designation 

♦ 1 = fracture of the radial condyle 

♦ 2 = Y-fracture 

♦ 3 = fracture of the ulnar condyle 



An overview of the classification system is given in 
Figures 3 and 4. An example is provided in Figure 5. 

Results 

The overall case pool that was included in the develop- 
ment of the classification system comprised 2308 frac- 
tures. Male patients were slightly overrepresented with 
56.8%. The risk of having a fracture before termination 
of growth was 1.2-1.6-fold higher in males. The average 
overall age of the patients was 8.1 years. The main loca- 
lisation of fracture was the forearm (54.1%), followed by 
the humerus (20.3%), the lower leg (20.4%) and the 
femur (5.2%). 2/3 of all fractures involved the metaphy- 
sis (65.1%), whereas fractures of the diaphysis occurred 
in 24.8% and fractures of the epiphysis in 8.1% of all 
cases. Most fractures occurred as a result of sports- 
related injuries (38.5%), followed by domestic accidents 
(23.0%) and playground accidents (19.9%) [30]. 

Intraobserver agreements 

Intraobserver agreement was determined with the 30 
cases used for sample size calculation in the very first 
agreement study and with the 185 cases of the final 
study. In the first series, there was test-retest agreement 
in 96% of cases for the first two dimensions, in 91.4% of 
cases for the first three dimensions, in 89.1% of cases 
for the first four dimensions, in 74.7% of cases for the 
first five dimensions and in 19.6% of cases for all six 
dimensions. This equated to Kappa values ranging from 
0.97 to 0.57. In the final version there was test-retest 
agreement in 97% of cases for the first two dimensions, 
in 97% of cases for the first three dimensions, and in 
87% of cases for the first four dimensions. This equated 
to Kappa values ranging from 0.99 to 0.86. 

Interobserver agreement 

The overall interobserver reliability of the initial classifi- 
cation was K = 0.58. Different Kappa values were found 
for the single dimensions. Assessing the localisation in 
the skeleton (CDl) and the paired bone (CD 6) showed 
the best agreement (localisation in skeleton k = 0.99, 
localisation in bone k = 0.91 and paired bone k = 0.99), 
whereas there was less agreement in assigning the child- 
specific fracture code (CD 4) with k = 0.66. Classifica- 
tion of the segment (CD 2 - metaphysis, epiphysis, dia- 
physis) showed only weak agreement k = 0.33. 

The only moderate agreement in the initial version 
was largely explained by the difficulty in distinguishing 
the metaphysis from the diaphysis, the greenstick from 
the buckle fracture and the transverse from the oblique 
diaphyseal fracture. Due to a lack of therapeutic rele- 
vance, e.g. their requirement for similar or identical 
treatment, some fracture types (e.g. metaphyseal 
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LiLa Classification 

overview joint fractures 



1.1.a.5.0-2.:otliers 

Almost no joint fx, flakes and avulsion fx of the 
tubercule are subsumed under others (=5) 


Proximal Humerus 


1.3.a.1-5.0-2.: 

1 . fx of the radial condyle 

2. Y-fx 

3. fx of the ulnar condyle 

4. others 




2.1.a.5.0-2.: others 

Almost no joint fractures, transitional fx and chisel fx 
in adolescents are subsumed under other (=5) 


Proximal Radius 


2.1. a.5.0-2.U: others 

Almost no joint fx. Olecranon fx and fx of the 
processus coronoideus are subsumed under others 
(=5) 




2.3.a.5.0-2.: others 

Almost no joint fx, typical epiphyseal fx and 
transitional fx are subsumed under others (=5) 


Distal Radius 


3 1 a 5 0-2 ■ others 

Almost no joint fx, flakes are subsumed under others 
(=5) 


PrnYimal Fpmiir 


3.3.a.1-5.0-2. 

1 . epiphyseal (Salter III) fx at open growth plates 

2. epi-metaphyseal (Salter IV) fx at open growth 
plates 

3. epiphyseal (two plane) fx at beginning closure of 
the growth plates 

4. epi-metaphyseal (triplane) fx at beginning closure 
of the growth plates 

5. others 


12 3 4 


4.1.a.1-5.0-2. 

1 . epiphyseal (Salter III) fx at open growth plates 

2. epi-metaphyseal (Salter IV) fx at open growth 
plates 

3. epiphyseal (two plane) fx at beginning closure of 
the growth plates 

4. epi-metaphyseal (triplane) fx at beginning closure 
of the growth plates 

5. others 


12 3 4 


4.3.a.1-5.0-2. 

1 . epiphyseal (Salter III) fx at open growth plates 

2. epi-metaphyseal (Salter IV) fx at open growth 
plates 

3. epiphyseal (two plane) fx at beginning closure of 
the growth plates 

4. epi-metaphyseal (triplane) fx at beginning closure 
of the growth plates 

5. others 


£ s r 



Figure 3 Overview of Li-La classification of paediatric long bone fractures: articular fractures. 
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LiLa Classification 

overview shaft fractures 



1.1-3.S.1-5.0-2. 

1 . epiphyseal separation with and without 

metapyhseal wedge (Salter 1 and II) 

2. metaphyseal buckle fx and metaphyseal greenstick 
fx / diaphyseal greenstick fx 

3. transverse, oblique and torsion fx 

4. multifragmentfx 

5. others 




[ M 


V 

1 r 


3 

) 


6 


2.1-3.S.1-5.0-2. 

1 . epiphyseal separation with and without 
metapyhseal wedge (Salter 1 and II) 

2. metaphyseal buckle fx and metaphyseal greenstick 
fx / diaphyseal greenstick fx 

3. transverse, oblique and torsion fx 

4. multifragmentfx 

5. others 


i 

c 




LX 


3 4 

i I 


i 

\ 
1 

1 

\ 


3.1-3.S.1-5.0-2. 

1 . epiphyseal separation with and without 

metapyhseal wedge (Salter 1 and II) 

2. metaphyseal buckle fx and metaphyseal greenstick 
fx / diaphyseal greenstick fx 

3. transverse, oblique and torsion fx 

4. multifragmentfx 

5. others 


\ 


Li 




'f 




4.1-3.S.1-5.0-2. 

1 . epiphyseal separation with and without 
metapyhseal wedge (Salter 1 and II) 

2. metaphyseal buckle fx and metaphyseal greenstick 
fx / diaphyseal greenstick fx 

3. transverse, oblique and torsion fx 

4. multifragmentfx 

5. others 


li 




r 


w 

r 

c 

/' 


4 

\ 



Figure 4 Overview of Li-La classification of paediatric long bone fractures: shaft fractures. 

V ) 



greenstick and buckle fracture) were subsumed in one 
group and the square over the physis was introduced to 
differentiate the distal part from the middle, i.e. the 
shaft. Indeed, this simplification resulted in an improve- 
ment in the agreement in ratings for the subsequent 
version of the classification system. Results for each 
dimension are based on all cases with agreement in the 
preceding dimension. Those cases with disagreement in 
the preceding dimension were not considered. 



After analysing the problems with the initial version in 
the first 3 series, the classification system was modified 
in the final series and then re-evaluated. 

♦ Dimension 1: no change was made 

♦ Dimension 2 (localisation in bone: segment): 
assigning the fracture localisation in the bone to a 
distal or proximal part, including the epiphysis and 
metaphysis, and a diaphysial part by defining the 
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Figure 5 A buckle fracture of the distal radius. The classification 
is determined as follows: localization in the skeleton - radius = 2; 
localization in the bone - metaphysis (square rule) = 3; morphology 
- shaft = s; fracture type - buckle fracture = 2; displacement - non- 
displaced = 0. Code: 2.3.S.2.O. 

v J 



metaphysis with a square over the physis improved 
agreement from k; = 0.33 in the initial version to k; = 
0.89 in the final version (177 of 185 cases applicable; 
Table 1). 

♦ Dimension 3 (morphology): distribution of frac- 
tures according to articular involvement. The overall 
Kappa coefficient was k; = 0.88 (141 of 185 cases 
applicable; Table 2). The accuracy of classification of 
articular and shaft fractures for the multicenter 
study are shown in Table 3. 



Table 1 Summary of agreement und Kappa values for CD 
2 = localisation in the bone 



CD2 


1 


2 


3 


all 


Agreement 


17 


39 


85 


141 


Total cases 


20 


49 


108 


177 


Kappa all 


0.95 


0.86 


0.90 


0.89 


Kappa senior 


0.95 


0.92 


0.96 


0.94 


Kappa junior 


0.95 


0.84 


0.87 


0.87 



proximal segment, ^diaphysis, ^distal segment 



Table 2 Summary of agreement und Kappa values for CD 
3 = morphology 



CD3 


A 


S 


all 


Agreement 


11 


116 


125 


Total cases 


18 


123 


141 


Kappa all 


0.90 


0.89 


0.88 


Kappa senior 


0.98 


0.96 


0.96 


Kappa junior 


0.85 


0.85 


0.84 


Full rater agreement (latent class) 




88.7% 





'articular, ^non-articular/shaft 



♦ Dimension 4: after subsuming fractures with the 
same therapeutic consequence in one group, specifi- 
cation of the child-specific morphology of the frac- 
ture resulted in a mean Kappa coefficient of k; = 0.72 
(127 of 185 cases applicable; Table 4). Agreement 
separated by fracture type (epiphysis, metaphysis, 
diaphysis) ranged from k; = 0.59-0.92 in the multi- 
center study (Table 5). 

♦ Dimension 5 (optional): all fractures were classified 
according to their subjective prognosis and 



Table 3 Accuracy of classification of articular and shaft 
fractures (A/S). 



Likely fracture type 


A (n = 18) 


S (n = 123) 


Fracture class distribution 


12.8% 


87.2% 


Rater 1 ^ 






A 


94.4% 


0% 


S 


5.6% 


100% 


Rater 2^ 






A 


100% 


0% 


S 


0% 


100% 


Rater 3^ 






A 


100% 


0% 


S 


0% 


100% 


Rater 4^ 






A 


94.4% 


0% 


S 


5.6% 


100% 


Rater 5^ 






A 


94.4% 


0.8% 


S 


5.6% 


99.2% 


Rater 6^ 






A 


88.9% 


3.3% 


S 


11.1% 


96.7% 


Rater 7^ 






A 


83.3% 


0% 


S 


16.7% 


100% 


Rater 8^ 






A 


100% 


0% 


S 


0% 


100% 



Fracture class distribution and probability of correct classification for each 
fracture class and rater, 
^senior rater, ^junior rater 
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Table 4 Summary of agreement und Kappa values for 
CD4 = child-specific fracture code 



CD4 


1 


2 


3 


4 


5 


all 


Agreement 


14 


25 


19 


0 


3 


61 


Total cases 


22 


50 


49 


2 


4 


127 


Kappa all 


0.79 


0.70 


0.69 


0.43 


0.94 


0.72 


Kappa senior 


0.86 


0.67 


0.57 


0.27 


0.91 


0.68 


Kappa junior 


0.76 


0.74 


0.77 


0.54 


0.95 


0.76 



^definition of CD4 outcomes 1-5 depends on articular or non-articular/shaft 
fracture 



therapeutic relevance as non-displaced (0), displaced 
but tolerable (1) and displaced and intolerable (2). 
Table 6 shows the Kappa coefficients for these. The 
results were not so favourable with a mean Kappa of 
k; = 0.61. Subsuming the undisplaced (0) and the tol- 
erable (1) fractures because of lack of therapeutic 
relevance resulted in a mean Kappa of k; = 0.83 
(Table 7) (61 of 185 cases applicable). 

The final version resulted in an overall interobserver 
agreement of ^ = 0.71 for the dimensions CD 1-4. 
There was no significant difference in k values between 
experienced {n = 3, k = 0.73) and less experienced (n = 
5, ^ = 0.72) raters. There was perfect agreement 
between the gold standard and the classification based 



on latent class modelling for CDl, CD2 and CD3. For 
CD4 and CD5 there were some minor differences. 

Discussion 

Although many classification systems have been widely 
adopted and frequently used in orthopaedic surgery, few 
have been scientifically tested for their reliability. Those 
that have been evaluated show generally low reliability 
but they are nonetheless still in common use. 

Considering the differing methodologies used in differ- 
ent studies, it is difficult to interpret the reported Kappa 
values with confidence. Our results indicated good relia- 
bility for dimensions CD 1-4 with an overall Kappa 
value of 0.71 for a group of clinicians who are interested 
in the topic; the values were not dependent on surgical 
experience. The majority of other studies reported lower 
levels of agreement (Table 8). One exception is the 
assessment of supracondylar fractures of the distal 
humerus using a modified Gartland classification [36], 
which showed an interobserver reliability of k = 0.74 
and an intraobserver reliability of k = 0.81-0.84. Simi- 
larly, an assessment of tibial plateau fractures according 
to the Schatzker classification, and based on conven- 
tional x-rays and MRI scans, revealed an interobserver 
agreement of ^ = 0.85 [23]. The AO paediatric classifi- 
cation shows Kappa coefficients for diagnosis of specific 
child patterns of 0.51, 0.63, and 0.48 for epiphyseal. 



Table 5 Overall assessment (dimensions CD1 


- 4) (127 cases applicable, agreement in dimensions 1-3 must have been 


achieved) 










Localisation in skeleton Localisation in bone 


Articular/shaft 


Child specific fracture code 


N 


Kappa 


1,2,3,4 1 and 3 


a 


1 (Salter 1) 


8 


0.83 


(Epiphysis) 














2 (Salter II) 










3 (Two plane) 










4 (Triplane) 










5 (others) 


3 


0.90 






All 


n 


0.79 


1,2,3,4 1 and 3 


s 


1 (epiphysiolysis) 


14 


0.72 


(Metaphysis) 














2 (incomplete Fx) 


40 


0.71 






3 (complete Fx) 


23 


0.64 






4 (multifragment Fx) 










5 (other) 


1 


n.a 






All 


78 


0.69 


1,2,3,4 2 


s 


1 non existent 






(Diaphysis) 














2 (incomplete Fx) 


10 


0.59 






3 (complete Fx) 


26 


0.60 






4 (multifragment Fx) 


2 


n.a 






5 (other) 










All 


38 


0.58 
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Table 6 Summary of agreement und Kappa values for 
CDS = displacement 



CDS 


0 


1 


2 


all 


Agreement 


3 


1 


21 


25 


Total cases 


16 


15 


30 


61 


Kappa all 


0.59 


0.30 


0.83 


0.61 


Kappa senior 


0.73 


0.24 


0.80 


0.65 


Kappa junior 


0.47 


0.29 


0.83 


0.56 



°not displaced, Misplaced but tolerable, Misplaced not tolerable 



metaphyseal, and diaphyseal fractures, respectively. The 
moderate Kappa values in our initial studies were largely 
explained by the difficulty in distinguishing the meta- 
physis from the diaphysis, the greenstick from the 
buckle fracture and the tranverse from the oblique dia- 
physeal fracture. As explained earlier, this classification 
was simplified because of its lack of therapeutic rele- 
vance. The metaphyseal buckle and greenstick fractures 
of the distal radius, for example, require exactly the 
same treatment, namely cast immobilisation [37]. Thus, 
discrimination between these two metapyhseal fracture 
types is not relevant and the simplification resulted in 
an improvement in the Kappa values for the interobser- 
ver reliability. 

The optional fifth digit, which indicates a tolerable or 
non-tolerable dislocation, resulted in good interobserver 
agreement = 0.83) if the non-displaced and the dis- 
placed but tolerable fractures were interpreted as one 
and the same class. The definition of displaced but tol- 
erable and displaced and not tolerable fractures is cur- 
rently based on the knowledge in the literature and 
enhances the clinical relevance substantially. In such a 
simplified mode, the fifth digit could be used in further 
studies for evaluating guidance for treatment. 

It has been suggested that a useful classification sys- 
tem must be hierarchical to offer guidance in determin- 
ing the optimal treatment method and to indicate the 
prognosis for a particular condition [34,38-40]. In con- 
trast to adult classifications, a hierarchical order for the 
paediatric fracture types (by severity, diagnostic or ther- 
apeutic management, or prognosis) is not possible or 
advisable because these parameters are influenced by 



Table 7 Summary agreement und Kappa values for CDS 
= displacement subsuming the undisplaced and the 
displaced but tolerable fractures 



CDS 


0/1 


2 


all 


Agreement 


25 


21 


46 


Total cases 


31 


30 


61 


Kappa 


0.84 


0.83 


0.83 


Kappa senior 


0.80 


0.80 


0.80 


Kappa junior 


0.84 


0.83 


0.83 



°not displaced, Misplaced but tolerable, Misplaced not tolerable 



many different factors. The injury pattern of children is 
stereotypical and seems to be much more dependent on 
the maturation stage of the physis than on the injury 
mechanism. This is why complicated articular fractures, 
as seen in adults, are not found in children as long as 
the epiphyseal plate is still wide open. Besides factors 
such as fracture localisation and extent of displacement, 
the choice of treatment is mainly influenced by the 
patient's age, since the prognosis for growth depends on 
this. It is also influenced by the growth plates and their 
maturity. Hence only a classification without hierarchies, 
which follows the neutral aspects of localisation and 
morphology, is useful in describing fractures in children. 
These non-hierarchical classifications mostly describe 
specific fractures of single localisations [36,41,42]. 

To our knowledge only one classification system of 
paediatric long bone fractures has been proposed to 
date. Its development and evaluation by the AO Paedia- 
tric expert group [24,25] proceeded at approximately the 
same time as the one presented in this paper. Hence, 
there are some similarities, but there are also important 
differences: 

♦ The main distinction concerns the precise separation 
of the intraarticular fractures from the fractures not 
involving the articular surface. The AO system classifies 
separation of the physis as an articular fracture. How- 
ever, a separation of the physis with or without meta- 
physeal wedge, generally known as Salter I and II 
fractures, does not involve the articular surface. These 
fractures are considered as the most peripheral shaft 
fractures of long bones. Thus, they have a different 
prognosis and need to be treated differently. In our opi- 
nion this aspect must be clearly considered in a paedia- 
tric classification system, which will ultimately be used 
to develop treatment guidelines and prognostic 
predictors. 

♦ It has been shown that the simpler the fracture clas- 
sification, the better its reliability [10,12,15,43]. For 
these reasons we tried to simplify our classification sys- 
tem to the necessary minimum. All infrequent lesions 
(0-1% of all fractures) were subsumed in one category. 
The only exception was the articular fracture of the dis- 
tal humerus, due to its importance. In contrast, the AO 
classification [24,25] includes different exceptions and 
additional codes, e.g. for supracondylar fractures, and 
fractures of the radial head or the proximal femur. 

Conclusions 

In conclusion, we have developed a paediatric classifica- 
tion system for fractures of the long bones, which has 
been shown to have good reliability. This classification 
system also accommodates determination of clinical 
consequences and hence surpasses the simple descrip- 
tion and definition of fractures. We therefore propose 
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Table 8 Examples of reliability of classification systems 



Author 



Classification 



Interobserver k 



Intraobserver k 



Humerus 

Sidor et al. 1993 [33] Neer (prox humerus) 

Siebenrock, Gerber 1993 [34] Neer (prox. humerus) 
AO/ASIF 



0.48 + 0.52 
0.40 

0.53 (types) 
0.42 (groups) 



0.66 
0.60 

0.58 (types) 
0.48 (groups) 



Sjoden et al. 1999 [35] 
Barton et al. 2001 [6] 
Radius 

Kreder et al. 1996 [19] 
lllarramendi et al. 1998 [16] 

Flinkkila et al. 1998 [14] 



Muller AO (prox. humerus) 
Neer 

mod. Gartland (paediatric fracture of the dist. humerus) 



Muller AO (dist. radius) 
Muller AO (dist. radius) 
Frykman 

Muller AO (dist. radius) 



0.32-0.34 
0.44-0.49 
0.74 



0.33-0.68 

0.37 

0.43 

0.18 (x-ray) 
0.16 (x-ray + CT) 



0.29-0.74 
0.27-0.73 
0.81-0.84 



0.25-0.42 

0.57 

0.61 



Femur 

Andersen et al. 1990 [2] 
Pervez et al. 2002 [28] 



Bj0rgul, Reikeras 2002 [8] 
Schipper et al. 2001 [30] 



Evans (prox. femur) 
Jensen (prox Femur) 
AO 

Garden (femoral neck) 
AO/ASIF (prox. femur) 



0.38-0.69 

0.34 

0.33 

0.62 (without subgroups) 
0.4 

0.33-0.34 



0.56-0.67 

0.34 

0.42 

0.71 (without subgroups) 
0.48 



Lower leg 

Thomsen et al. 1991 [40] 
Martin et al. 1997 [23] 



Swiontkowski et al. 1997 [39] 
Yacoubian et al. 2002 [43] 



Lauge-Hansen (dist. tibia) 
Weber 

Ruedi und Allgower (dist tibia) 
AO/ASIF 

AO/OTA (dist. tibia) 

Muller-AO, Schatzker (tibial plateau) 



0.49 + 0.60 
0.58 + 0.56 
0.46 

0.60 (types) 
0.38 (groups) 
0.49-0.58 
0.68 (x-ray) 
0.73 (+ 
0.85 (+ MRI) 



0.60-0.70 
0.60-0.76 
0.55 

0.70 (types) 
0.48 (groups) 



Other 

Ward et al. 1997 [42] 
McAdams et al. 2002 [24] 



Severin (hip) 
Scapular neck 



0.16 

0.30 (weighted) - 0.81 



0.34 

0.49 (weighted) 



Li-La 



All long bones and localisations 



0.71 



0.78-0.93 



use of this classification system in future prospective 
studies including those examining the relevance of ther- 
apeutic measures. The latter should include evaluation 
of the minimum necessary diagnostic and therapeutic 
procedures leading to an optimum outcome. 
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